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Abstract — The differential encoding/decoding setup introduced 
by Kiran et al, Oggier-Hassibi and Jing-Jafarkhani for wireless 
relay networks that use codebooks consisting of unitary matrices 
is extended to allow codebooks consisting of scaled unitary 
matrices. For such codebooks to be usable in the Jing-Hassibi 
protocol for cooperative diversity, the conditions involving the 
relay matrices and the codebook that need to be satisfied are 
identified. Using the algebraic framework of extended Clifford 
algebras, a new class of Distributed Differential Space-Time 
Codes satisfying these conditions for power of two number of 
relays and also achieving full cooperative diversity with a low 
complexity sub-optimal receiver is proposed. Simulation results 
indicate that the proposed codes outperform both the cyclic 
codes as well as the circulant codes. Furthermore, these codes 
can also be applied as Differential Space-Time codes for non- 
coherent communication in classical point to point multiple 
antenna systems. 

Index Terms — Algebra, cooperative diversity, low decoding 
complexity, space-time coding. 



I. Introduction 

CONSIDER the scenario of R + 1 users wanting to 
communicate to a single destination using cooperative 
strategies. For simplicity, we consider one of the R+l users as 
the source and the remaining R users as relays. Predominantly 
there are two types of relaying strategies discussed in the 
literature, i.e., 1) amplify and forward and 2) decode and 
forward. In this work, we focus only on amplify and forward 
based relaying strategies wherein the relays are allowed to 
perform linear processing of the received signal. Amplify 
and forward based relaying is simpler and moreover does 
not require the relays to have knowledge of the codebook 
used by the source or the knowledge of the source to relay 
channel fading gains. Recently in (TJ, the idea of space- 
time coding for collocated MIMO (Multiple Input Multiple 
Output) systems has been applied in the setting of cooperative 
wireless relay networks in the name of distributing space- 
time coding, wherein coding is performed across users and 
time. This strategy provides each user a diversity order equal 
to the number of relays even though all the users are only 
equipped with a single antenna. The diversity thus achieved 
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is called as cooperative diversity. However, such strategies 
require that the destination have complete knowledge of the 
fading coefficients from all the users to itself as well as 
that of the fading coefficients between users. Obtaining the 
knowledge of the fading coefficients between the users at 
the destination requires additional resources. To solve this 
problem, in 0, Kiran et al have proposed a differential 
encoding/decoding setup for cooperative wireless networks 
that does not require the knowledge of fading coefficients 
between the users at the destination. Such codes were named 
as partially coherent distributed space-time codes in 0. In a 
recent work it has been shown that the same strategy of 
offers full diversity with a suboptimal receiver that does 
not require the knowledge of any of the fading coefficients. In 
an independent work [3|, Jing and Jafarkhani have proposed a 
differential encoding/decoding setup for cooperative wireless 
relay networks which is more general than the setup proposed 
in 0, [4 | and they have also provided few code constructions. 
We call the class of Differential Space-Time Codes (DSTCs) 
which can be used in a distributed manner for cooperative 
diversity as Distributed DSTCs (and denote by DDSTCs) to 
differentiate them from DSTCs for collocated MIMO systems. 
The problem of designing DDSTCs is more challenging than 
that of DSTCs for collocated MIMO systems, since in this 
scenario we have additional constraints to be satisfied which 
are due to the cooperative diversity protocol. In [5], the setup 
of 0, has been generalized to the case when the source and 
destination nodes have multiple antennas. In 0, the authors 
propose the use of a random diagonal unitary matrix codebook 
in general and for certain specific number of relays (3, 6 and 
9) propose the use of cyclic codes. However, a random code is 
not guaranteed to offer full diversity. Moreover designing good 
cyclic codes becomes difficult for large number of relays. In 
101 , the authors propose the use of Alamouti code and Sp(2) 
code for the specific case of 2 and 4 relays. For an arbitrary 
number of relays, the authors construct fully diverse DDSTCs 
based on circulant matrices. But, except for the Alamouti 
code all other DDSTC constructions in the literature 0,0, 
llH, have large decoding complexity. Thus, a general full 
diversity code construction targeting the requirements of low 
encoding complexity as well as low decoding complexity is not 
available. This issue gains significant importance especially if 
the number of cooperating terminals is large, which is quite 
expected in applications like wireless sensor networks. 

In this paper we address this issue and present the following 
contributions: 

• The notion of encoding complexity for Space-Time codes 
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in terms of g-group encodability is introduced (Definition 
Q]) and its significance is highlighted. 

• The differential encoding/decoding setup introduced by 
Kiran et al 0, Oggier et al [5| and Jing et al [3| for 
wireless relay networks that use codebooks consisting 
of unitary matrices is generalized to allow codebooks 
consisting of scaled unitary matrices. This generalization 
aids in reducing the encoding/decoding complexity of 
DDSTCs. When the codebook of scaled unitary matrices 
is used in the Jing-Hassibi protocol [1] for differential 
encoding at the source, the extra conditions involving 
the interrelationship between relay matrices and the code- 
word matrices that need to be satisfied are identified (Eqn. 
©)■ 

• Using the algebraic framework of extended Clifford al- 
gebras, a new class of DDSTCs satisfying these extra 
conditions (Constructions [431 and |4.4| > is proposed which 
achieves full diversity with a low complexity sub-optimal 
receiver (Theorem [3]). Explicit construction of multidi- 
mensional signal sets that lead to full diversity (Theorem 
|2|l are also provided. To the best of our knowledge, this is 
the first known low decoding complexity DDSTC scheme 
for cooperative wireless networks which is available for 
all power of two number of relays. 

A. Organization of the paper 

Section [TT] introduces the system model for the distributed 
differential encoding/decoding setup employing a differential 
scaled unitary matrix codebook at the source. In Section [HI] 
the notion of g-group encodable and g-group decodable space- 
time codes is discussed for the system model of Section [II] 
and the problem statement for DDSTCs with low encoding 
and decoding complexity is given. The extra conditions on the 
code structure imposed by the cooperative diversity protocol 
of (2, 0, 0, are then described. In Section [IV] we 
explicitly construct a family of DSTCs using extended Clifford 
algebras and prove using algebraic techniques that they satisfy 
the extra conditions needed for DDSTCs. Simulation results 
and discussion on further work comprise Sections [V] and [VI] 
respectively. 

B. Notation 

For two sets stf\ and sfo, sd\ x denotes the Cartesian 
product of si\ and =2/2- For a matrix C, the matrices C*, 
C T and C H denote the conjugate, transpose and conjugate 
transpose of C respectively. For a complex number x, xi 
and xq denote its in-phase and quadrature-phase components 
respectively. For a vector a, a[i] denotes the i th entry of a. 
An all zero matrix of appropriate size will be denoted by 0. 
A complex Gaussian vector with zero mean and covariance 
matrix tt will be denoted by £A/"(0,fi). 

II. System model employing differential scaled 

UNITARY MATRIX CODEBOOK 

In this section, we briefly explain the distributed differential 
encoding/decoding setup proposed in [2|, [3|, |5| with a slight 
modification. 



We consider a network consisting of a source node, a 
destination node and R other relay nodes which aid the source 
in communicating information to the destination. All the nodes 
are assumed to be equipped only with a single antenna and 
are half duplex constrained, i.e., a node cannot transmit and 
receive simultaneously at the same frequency. The wireless 
channels between the terminals are assumed to quasi-static and 
flat fading. The channel fading gains from the source to the i- 
th relay, /j and from the j-th relay to the destination gj are all 
assumed to be independent and identically distributed complex 
Gaussian random variables with zero mean and unit variance. 
Moreover, we assume symbol synchronization among all the 
nodes. 

Every transmission cycle from the source to the destination 
comprises of two stages. In the first stage, the source transmits 
a T(T > R) length vector ^/tt^TPs which the relays receive. 
Here, P denotes the total average power spent by all the relays 
and the source. The fraction of total power P spent by the 
source is denoted by k\. The vector s satisfies E[s ff s] = 1 
and is a function of the information that the source intends 
to communicate. The exact relation of s with the information 
sent by the source will be made precise in the sequel. The 
received vector at the j-th relay node is then given by 
r-j = y/irxTPfjS + Vj, with vj — CN{0,It)- In the second 
half of the cycle, all the relay nodes are scheduled to transmit 
together. The j-th relay node transmits a T length vector tj 
which is a function of rj. The relays are only allowed to 
linearly process the received vector rj or its conjugate rj*. 
To be precise, the j-th relay node is equipped with a T x T 
unitary matrix Aj (called relay matrix) and it transmits either 

tj = <fiSfa*i*i or ^ = /SSiAji-j*. Without loss of 
generality, we may assume that M relays linearly process rj 
and the remaining R — M relays linearly process rj * . Here, 
ir-2 denotes the fraction of total power P spent by a relay. If 
the quasi-static duration of the channel is much greater than 
2T time slots, then the received vector at the destination is 
given by 

where, 

X = [ A x s . . . A M s A M +is* . . . A R s* ] , 

h = [ Agi ■ ■ ■ Im9m /m+i3m+i •■■ Ir9r] 1 

n = y / 5?T (E"i 9j -Ajvj + Ef=M+i SjAjVj*) + w 

and w ~ CjV(0, It) represents the additive noise at the 
destination. The power allocation factors 7Ti and 7T2 are chosen 
in such a way to satisfy -K\P + 1T2PR = IP. Throughout this 
paper, we chose m = 1 and 7r 2 = i as suggested in (TJ. 

The differential encoding is performed at the source as 
follows. An initial vector so known to the destination which 
satisfies so^so = 1 is transmitted by the source in the first 
cycle. The initial vector so should be such that the initial 
matrix 

X = [ Ais . . . A M So A M +iS * ■ • ■ A R s * ] 

(2) 
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is unitary. The transmitted vector at the i-th cycle is then given 
as follows 

s t = U t s t _i (3) 

at-i 

where, Ut £ 'tf is the codeword containing the information 
at the t-th cycle which satisfies Ut^Ut = o^It, at G 
The codebook c € at the source consisting of scaled unitary 
matrices should satisfy E[af] = 1 in order to meet the 
power constraint at the source and relays. The originally 
proposed coding strategies in 10, El, [5 1 force a t = 1 for all 
codewords. The received vector at the destin ation in the t-th 
cycle can then be written as y t = ^ j^p+^p -X-t ht + n t where, 
X t = [ Ais t ... A M s t A M +iS t * ... A R s t *].If 



AiU t = U t A i; V U t _ c <f, i = 1, . . . , M and 
A ; Ut* = U t A i; V U t S <€, i = M + 1, . . . , R 



(4) 



then, substituting for St from © we get X t = ^_U t Xt-i. 
If the channel remains approximately constant for 4T channel 
uses, then we can assume that ht = ht-i. Then, 



yt 



where, fit 



TV 1 7T 2 TP 2 



n t 



TTiir 2 TP 2 



fX t ht 

i-.UtXt.iht_! 



7riP+l a 

^Utyt-i + n t 



n t 



(5) 



-U t nt-i +nt. Now we propose to decode 



the codeword U t as follows 



U t = arg min || y t 



o*-i 



-U t y t - 



(6) 



where, at-i can be estimated from the previous decision. Note 
that the above decoder does not require the knowledge of the 
channel gains and the relay matrices A\,i — 1, . . . , R. Also, 
note that this decoder is not a Maximum-Likelihood (ML) 
decoder since the equivalent noise vector fit is dependent on 
the previous codeword Ut-i through a t -i- However, for the 
collocated MIMO case, it has been shown in (6), Q that there 
is no significant performance loss due to this. In this setup, we 
call c g a DDSTC to distinguish it from DSTCs for collocated 
MIMO systems. 

III. Problem Statement 

In this section, we describe the problem statement of 
designing DDSTCs with low encoding complexity and low 
decoding complexity. 

In order to benefit from low encoding complexity and low 
decoding complexity, we propose to choose the DDSTC c € to 
be a g— group encodable (g > 1) linear STBC. 

Definition 1: A linear design S(si, S2, ■ ■ ■ , sk) of size nxn 
in K real indeterminates or variables s\, S2, ... ,Sk is a nx n 
matrix with entries being a complex linear combination of the 
variables s\, S2, . . . , sk- To be precise, it can be written as 
follows: 



S(S1, S2, . . . , Sk) = V_ SiBi 



K 

£• 

i=l 



where, Bj e C" x " are square complex matrices called 
the weight matrices. A linear STBC ^ is a finite set of 



nxn complex matrices which can be obtained by taking 
a linear design S(s 1; s 2 , ■ ■ ■ , sk) an d specifying a signal 
set srf C M. K from which the information vector x = 
[ si S2 ... sk 1 take values, with the additional con- 
dition that S(a) ^ S(a'), V a ^ a' 6 . A linear STBC 
^ = {S(x)|x e s/} is said to be g-group encodable (or 
real symbol encodable or J| complex symbol encodable) if 

x s/ g where each 



sYi x s/2 x 



g divides K and if s/ 
s/i,i = l,...,g C KT , 

Thus, given a g-group encodable STBC, it defines a nat- 
ural partitioning of the real variables of its associated lin- 
ear design into (/-groups. This partitioning defines a par- 
titioning of the set of weight matrices of S(x) into g- 
groups, the fc-th group containing K/g matrices. For sim- 
plicity we assume the simplest partitioning of the infor 



mation symbol vector as x = [ xi T X2 1 

where, Xk = s (fc-i)g 

L 9 
S(x) can be written as, 



+1 



S (k-l)K 



+ 1 



S(x) = ^Sk(xk) where, S k (x k ) = 



k=l 



E 

{k-l)K 



Now 



SjBi. 



+ 1 



Minimizing the decoding metric corresponding to (|5), 

1 



yt 



Ot-l 



-S(x)y t - 



is in general not the same as minimizing 

1 



yt 



Ot-l 



-S k (x k )yt-i 



(7) 



(8) 



for each 1 < k < g individually. However if it so happens, 
then the decoding complexity is reduced by a large extent. 
Definition 2: A linear STBC ^ = {S(x)|x € sf) is said 



to be g-group decodable (or — real symbol decodable or j- 
complex symbol decodable) if it is g-group encodable and if 
its decoding metric in <___> can be simplified as in (0. 

Theorem 1: |[T_1 The decoding metric in (0 can be sim- 
plified as in ([8]) if 



B^B, 



b/b ; 







(9) 



for all weight matrices Bi and Bj belonging to two different 
groups. 

We illustrate the significance of the above definitions con- 
cerning encoding and decoding complexity by giving few 
examples. 

Example 3.1: Consider the Golden code for 2 transmit an- 
tennas. It has 8 real variables. For the coherent MIMO channel, 
the signal set used is QAM for each complex variable. Hence, 
this code is a 8-group encodable (since QAM is a Cartesian 
product of two PAM signal sets) and 1 -group decodable linear 
STBC. However, if we now impose the requirement for unitary 
codewords, then we will have to solve for signal sets which 
will yield unitary codewords inside the division algebra. This 
may amount to entangling all the 8-real variables which will 
make the code 1 -group encodable and 1 -group decodable. This 
approach was recently attempted in J9)- 
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Example 3.2: Let us take the example of the Alamouti code 
for 2 transmit antennas. It has 4 real variables. If the signal 
set is chosen to be PSK for every complex variable, then all 
the codewords become unitary matrices. Hence the resulting 
code is 2-group encodable and 2-group decodable. However, 
note that if we take square QAM to be the signal set for 
each complex variable, then we get a 4-group encodable (since 
square QAM is a Cartesian product of two PAM signal sets) 
and 4-group decodable code, but now the codewords are scaled 
unitary matrices as opposed to unitary matrices. Thus relaxing 
the codewords to be scaled unitary matrices allows us to lower 
the encoding and decoding complexity. 
The above two examples show that the choice of signal sets is 
crucial in obtaining low encoding and decoding complexity. 
Example 13.21 explicitly shows how allowing scaled unitary 
codebooks aids in reducing the encoding/decoding complexity. 

The DDSTC design problem is then to design a g-group 
decodable linear STBC 

^={S(X= [ Sl S 2 ... S K ])\x€Stf} 

of size T x T to be used at the source such that 

1) All codewords are scaled unitary matrices satisfying the 
transmit power constraint. 

2) The parameters K and g are maximized. Increasing K 
is motivated by the need for high rate transmission and 
maximizing g is motivated by the requirement for low 
decoding complexity. 

3) There exist R unitary matrices Ai,A2,...,Ar 
of size T x T such that the first M of them 
satisfy A;C = CA ; , i = 1, . . . ,M, VCe? 
and the remaining R — M of them satisfy 
AiC* =CA i( i = M + l,...,R, VCetf. 

4) There exists an initial vector s such that the initial 
matrix X is unitary. 

5) min Sli s 2 e^ |(S X - S 2 ) H {S 1 - S 2 )| is maximized, i.e., 
the coding gain is maximized. 

Observe that the requirements for designing DDSTCs are 
much more restrictive than that for DSTCs. Note that condition 
(0 and condition are not required for designing DSTCs. 
As an additional requirement it would be nice to have a single 
design S(xi, X2, ■ ■ ■ , xk) and a family of signal sets, one for 
each transmission rate such that all the required conditions 
are met. This means that we need to be able to find R relay 
matrices satisfying the required conditions irrespective of the 
size of the code \&\. 

IV. Explicit construction of DDSTCs 

In this section we give an explicit construction of full 
diversity 4-group decodable DDSTCs for all power of two 
number of relays. Optimizing the coding gain for the proposed 
DDSTCs is difficult in general and is beyond the scope of this 
paper. Constructing g— group decodable DDSTCs for arbitrary 
g appears to be rather difficult and a brief discussion on the 
issues involved is given is Section [VT] In Subsection IIV-AI the 
construction of the proposed linear designs using tools from 
extended Clifford algebras is given. In Subsection IIV-BI an 
explicit construction of signal sets leading to full diversity 



for arbitrary transmission rate is provided for the linear designs 
given in Subsection IIV-AI thus completely describing the 
construction of the linear STBC c & . In Subsection IIV-CI we 
use algebraic techniques to explicitly construct the R relay 
matrices satisfying the required conditions for arbitrary size 
of the codebook ^ . 



A. Construction of linear design 

In this subsection, we briefly describe a construction of 
a class of rate one, linear designs satisfying the conditions 
for four-group decodability which were first obtained using 
extended Clifford algebras in [10|. Extended Clifford algebras 
and linear designs from them are addressed in detail in |10|. 
However [ 1 1 chooses signal sets as applicable for coherent 
communication in a distributed space-time coding setup. A 
brief explanation of extended Clifford algebras and linear 
designs using them is provided in Appendix [A] This alge- 
braic framework given in Appendix lAl is needed to explicitly 
construct the relay matrices and the initial vector in Subsection 

Construction 4.1: Given a n x n linear design 
A(xi, X2, ■ ■ ■ , xl) in L complex variables xi, X2, ■ ■ ■ , Xl, 
one can construct a 2n x 2n linear design D as follows: 

j-j A(xi,X 2 , . . . ,X L ) B(x L +l,X L +2 1 ■ ■ ■ ,X2l) 

B(x L +l, X L +2, ■ ■ ■ ,X2h) A(X!,X 2 , . . . ,X L ) 

where, the linear design B(xl+i, xl+2, ■ ■ ■ , %2l) is identical 
to the linear design A(xi, X2, . . . , Xl) except that it is a design 
in a different set of complex variables xl+x, xl+2, • ■ ■ , %2L- 
We call this construction as the 'ABBA construction' which 
was first introduced in lf8l . 

Construction 4.2: Given a n x n linear design 
A(xi, X2, . . . ,Xi), one can construct a 2n x 2n linear 
design S as follows. 

g_ A(xi,X 2 , . . . ,X L ) -~B H (xl+1,XL+2, ■ ■ ■ ,X2h) 

B(x L+1 ,X L+ 2, . . . ,X 2 l) A H (xi,X 2 , ■ ■ ■ ,X L ) 

where, the linear design B(xl+i, xl+2, ■ ■ ■ , %2l) is identical 
to the linear design A(xi, X2, . . . , xl). We call this construc- 
tion as the 'doubling construction'. 

We are now ready to describe the construction of the proposed 
linear designs. 

Construction 4.3: To obtain a linear design for R = 2 A 
relays, follow the steps given below. 

Step 1: Starting with the linear design [xi] in complex variable 
xi, keep applying ABBA construction iteratively on it till a 
2 A_1 x 2 A_1 linear design D is obtained. 
Step 2: Apply doubling construction on D and scale it by -4= 
to obtain the linear design. 

Example 4.1: Following the steps given above for R = 8 
relays, we get 
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(10) 



B. Construction of signal sets leading to full diversity 

In this subsection, we construct signal sets for the linear 
designs constructed in the previous subsection that lead to full 
diversity STBCs. In general, the signal sets should be designed 
such that the resulting linear STBC c & meets the following 
requirements. 

1) Scaled unitary codeword matrices meeting power con- 
straint. 

2) Four-group encodable and Four-group decodable. 

3) Difference of any two different codeword matrices 
should be full rank. We call such a code to be 'fully 
diverse' . 

4) The minimum determinant of the difference of any two 
codewords matrix should be maximized (Coding gain). 

We shall first illustrate the signal set construction procedure 
for 4 relays and derive important insights form it. Then, we 
generalize the ideas thus obtained for constructing signal sets 
for any R = 2 A relays. The design for 4 relays according to 
the construction in the previous subsection is given by 



1 

7t 



Xl 


x 2 


~X*3 
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x 2 


Xl 


#4 


* 

~ x % 


x 3 


X4, 


xl 


X 2 


X4, 


x 3 


X 2 


* 

x * 



(11) 



Let us look at S H S shown in ( fT2l at the top of the next page 
to find out the conditions on the signal sets under which the 
codewords are scaled unitary matrices. 

From (1121 1. we see that the signal set should be chosen such 
that the following condition is satisfied for all the signal points. 



X\X2 + X* 2 Xi + x \Xi + 17:4X3 = 0. 



(13) 



Firstly, we identify the grouping of the real variables into 
groups such that the corresponding weight matrices in each 
group satisfy According to the construction, the four 
groups or real variables are as follows: First group: {xn, X21}, 
Second group: {xiq,X2q}, Third group: {2:3/, xa}, Fourth 
group: {x3Q,XiQ}. It is important to choose signal sets 
such that they do not enforce joint constraints on variables 
from different groups. The requirement for scaled unitary 
codewords as in (13[ can be satisfied without disturbing 4- 
group encodability if all the signal points satisfy the following 
equation: 



XUX21 



-X\QX 2 Q = Ci, X31XU 



-X3QXAQ = C 2 (14) 



where, c\ and C2 are positive real constants. Then, the average 
power constraint requirement can be met by satisfying the 
conditions, 



L 2I) 

4) 



= 1, H X 1Q 



1. 



E(s§ 



L 2Q. 



= 1, 
= 1. 



(15) 



E(xjj 

^( x 3i ~r ^n) — J-, ^V^3Q "T -^iQ) 

A common set of solutions for (fl4l and (TTBT l can be obtained 
by taking points on the intersection of circles and hyperbolas. 
To meet the third requirement of full diversity we make use 
of the structure of the constructed designs. Note that 



lAS^ASI 



»|[ 



A A A A - 
AAAB 



AB H AB 
ABAA 



' AB 



(AAAB - ABAA)' 
AAAA fl + ABAB f 



_ 1 I [ AA H AA + AB f 
~ 4 I |_ AAAA* 

> imax(|AA|*|AA|, |AB|*|AB|) 2 





+ ABAB f 



(16) 

where, the notation A has been used to denote the difference 
matrix which will have the same form as the associated linear 
design. The second equality in ( |T6b follows because the linear 
designs A and B commute. This can be proved as follows. Let 



A = 



W 
X 



X 

w 



and B = 



W 

X' 



X' 

w 



Then, we have 
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± 

is fully 
we have 



Thus we observe that linear designs A and B commute if 
their constituent sub-matrices commute. Applying this argu- 
ment recursively, the claim follows. 

Thus S is fully diverse, i.e 
if either of the submatrices A 
diverse. For the example of 4 

Ax2 Axi 

Hence we can guarantee full diversity by ensuring that 
Axi ^ ±Ax2 and AX3 7^ ±Ax4. Just like before, we 
should be careful not to disturb 4-group encodability in the 
process. We take care of that requirement also by satisfying 
the following conditions: 



|AA| = 



(An 



|AS| 
or B 
relays, 

Ax 2 )(Aa;i - 



Axi/ ^ ±Ax 2 i, Ax 1Q ^ ±Ax 2 q, 
Ax 3 i ^ ±Axn, Ax 3Q ^ ±AxiQ. 



(17) 



Putting together all the conditions that need to be met, 
we have to choose constellation points satisfying ( [Pfl ). ( fl3b 
and (1171 1. The solution can be found simply by finding the 
intersection of points on the unit circle x 2 + y 2 — 1 with a 
hyperbola xy = c, where c < 1 on the two dimensional xy 
plane. This is illustrated in Fig. [T] 

Observe from Fig. Q] that the hyperbola intersects the circle 
at four different points. But the full diversity criterion demands 
that Ax 7^ ±Ay. After enforcing this condition, only two 
points survive out of the four points. They can be either 
the set of points marked A or the set of points marked B 
in Fig. Q] Thus we have obtained a signal set containing 2 
points. Supposing that we need more than two points, we 
can draw more circles (centered at origin) with radii such 
that the average power constraint is met and then find those 
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X*X2 + X^X\ + X3X4 + 2J|2;3 
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(12) 



1.5 




Fig. 1. Signal set structure in 2 dimensions 



points intersecting with the hyperbola. More precisely, to get 
to points, we draw -P concentric circles with increasing radii 

ri, T2, . . . , ra such that X)j=i r ? = y- Then we find those 
points intersecting with the hyperbola xy = c where, c is 
a positive number less tharQ r\. In this manner we can get 
the desired signal set for the variables xn,X2i and xai,X4j. 
The signal set for the variables X\q,X2Q and x^q,X4q can 
be obtained by considering a different hyperbola. This is 
illustrated in Fig. [2] shown at the top of the next page. 

Based on empirical studies, we propose to choose points in 
the set A and set B alternatively on each circle. Also, choosing 
the constants c\ = C2 = seems to give the largest coding 
gain. Similar observations have also been made in [7| for 
another class of linear STBCs applicable in co-located MIMO 
systems. Implementing these two suggestions makes the signal 
identical for all the 4-groups and the resulting signal set for 
the case of 8 points is shown in Fig|3] Note that the proposed 
signal set construction is far from general and the most general 
solution would correspond to the following equations: 



X11X21 = -xiqx 2 q 
E(x 2 u + x 2 2I ) -- 

E(x§ J + x 2 AI ) = d 3 , E(xl + x 



= Ci, X37X47 

di 



^( X 1Q + X 2Q 



X3QX4Q = C 2 , 



Axu ^ ±Ax 2 i, Axiq ^ ±Ax 2 q, 
Ax 3 i ^ ±Ax±i, Ax 3Q ^ ±Ax4 Q . 

In general, we need to choose the parameters 
Ci, C2, di, di, da, (^4, ri, . . . , ra and the solution points 
such that the coding gain is maximized. 

' This condition is necessary since otherwise the hyperbola will not intersect 
the circle with least radius. 



0.5 



=~ 



-0.5 



-1.5 




-1.5 



-0.5 



0.5 



1.5 



Fig. 3. Proposed signal set in 2 dimensions 



Based on the detailed illustration of signal set construction 
for 4 relays antennas, a natural generalization of it to higher 
dimensions is given in Construction I4.4I as follows. 



setc 



Construction 4.4: Suppose that we want a Q points signal 



for the constructed design for R 
Then the resulting signal set C R 2 



2 relays, 
should be a Cartesian 
product of 4 signal sets in IR 2 since we insist on 4-group 
encodability. In our case, we choose all the four sets to be 
identical and each will contain tfQ points. Let the signal 
points in M 2 be labeled as p», i = 1, . . . , \fQ. If i = 2q+r, 
for some integers q and r where 1 < r < 2, then p; is given 

by 



Pi\j] = Vj ^ (q mod 2 A 
Pi[(q mod 2 A -!) + 1] = +r q+ 
Pi[(q mod 2 A -!) + 1] = -r q+ 



) + l 
if r = 1 
if r = 2 



(18) 



where, 



> n 



2=1,.. 

Vt = 1, . 



are positive real numbers such that 

..f-landEj^-W 



2 i-*%=\ ' 1 2 ' 

Example 4.2: Let R = 2 3 = 8 and Q = 16 4 . Thus the 
rate of transmission of this code will be log | — = 2 bits per 
channel use. The design is given in ( fTOb . The corresponding 
four dimensional signal set is described below. 
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Fig. 2. General signal set for four relays 
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(19) 

where, ri = 0.3235, r 2 = v^^i, r 5 — 3ri, r 3 = r 2 + '' 5 g r2 , 
r 4 = r 2 + 2^, r 6 = (2 + V3) r u r T = r 3 + 2r x , 
rg = r4 + 2r%. Though the dimension of the signal set is 
4, due to the special structure of the signal set, we can study 
the two dimensional projections of the signal points which is 
graphically shown in Fig. [4] at the top of the next page. 

The proposed multidimensional signal sets in Construction 
14.41 are far from generality and obtaining a general solution 
targeting to maximize the coding gain appears to be difficult. 

Theorem 2: Construction l4.4l provides fully diverse signals 
sets for the designs given in Construction 14.31 

Proof: The constructed designs for 2L complex variables 
has the structure as shown in Construction 14.21 The partition- 
ing of the real variables into four groups is as follows. 

1) First group : {x u , x 2 i, x LI } 

2) Second group : {xiq,x 2 q, ■ ■ -,x LQ } 

3) Third group : {x (L+1)I , X( L+2 )i , ■ ■ ■ , x 2 li} 

4) Fourth group : {x {L+1)Q .x (L+2)Q , x 2LQ ) 

The signal set is a Cartesian product of 
smaller dimensional signal sets C M. L . 
A H A + B ff B ^ 

AA H + BB H 

the designs A and B are identical, it is sufficient to show 
that the design A(xi,x 2 , ...,xl) gives rise to fully diverse 



S H S = 



four 
R-k. Since 

and because 



scaled unitary codewords for the chosen signal set. Let the 
given signal set be denoted by A C M. L . Then the following 
properties are satisfied V x, y £ A, x ^ y. 

x[i\x[j] = 0, 1 < i ^ j < L (20) 
1<|{*| (x-y)[i\ ^0} | < 2 (21) 
(x-y)\i] ? ±(x-y)[j], i^j (22) 

The design A(xi, x 2 , . . . , x£) can be obtained by 
repeatedly applying ABBA construction on the linear 
design [x{\. The off-diagonal entries of the matrix 
A(xi,x 2 , . . . ,xl) h A(xi,x 2 ,. . . ,xl) are sum of 
j + x*Xi, i j. But we have, 

which is equal to 
zero for the signal set A by virtue of d20l . Thus we are 
guaranteed of scaled unitary codewords. 

Due to the ABBA structure of the linear design 
A(xi,x 2 , . . . ,xl) , the determinant can be obtained in an 

iterative manner. Let A(xi,x 2 , . . . ,xk) - 

Then we have 



terms like x*Xj 

X,- Xj XjXi — 2 (xijXjJ ~t~ XiQXjQ 



Ai 
A 2 



A 2 
Ai 



I 
I I 



A x A 2 
A 2 Ai 



I 
I I 



Ai - A 2 A 2 
Ai + A 2 
A 1 +A 2 |)(|A 1 -A 2 |). 

The above equation suggests that |A(xi,a;2, 
can be computed recursively. For L = 2, 
|A(xi, x 2 )\ — (xi +X2)(xi — x 2 ). Using this, we can easily 
show that |A(xi, X2, . . • , Xi)| is a product of L terms. A 
typical term in the product looks like (xi ± x 2 ± • • • ± xl). 
Thus to ensure | AA(xi, x 2 , ■ ■ ■ , Xl)\ ^ 0, each term 
in the product should not equal zero. Thus we need 
(Axi ± Ax2 ± ■ • • ± Axl) 0. Let us look at the real part 



IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS , VOL. XX, NO. XX, XXXX 




X 3 



Fig. 4. Two dimensional projections of four dimensional signal set for Eight relays 



of this expression. We get (Axu ± AX21 ± • ■ ■ ± Ax^j). 
For the given signal set A, from (fSTJ and (l22l i. we have 
(Axu ± AX21 ± • • • ± Axlj) 7^ 0. Thus the theorem is 
proved. ■ 

C. Construction of Relay Matrices and Initial vector 

Theorem 3: For the linear designs given by Construc- 
tion 14.31 there exist R relay matrices satisfying condition 
(O for arbitrary signal sets. Moreover, if the initial vector 
so = [ 1 ... ] , then the initial matrix Xo given in 
(ffji becomes unitary. 

Proof: See Appendix IE1 ■ 

Example 4.3: Let R = 4. Then the DDSTC ^ is obtained 
using the design shown in ([TT1 i and the signal set given in 
Construction 14.41 The signal set is a Cartesian product of 
four 2-dimensional signal sets. The relay matrices are given 
as follows: 



Ai = I 4 , A 2 = 



A, 







1 
1 



-1 

-1 







10 
10 
1 
10 








0-1 

1 

1 



The initial vector so = [ 1 ... ] and the initial 
matrix Xo = I4. This DDSTC is single complex symbol 
decodable (or 2 real symbol decodable). 

V. Simulation Results 
In this section, we compare the error performance of the 



proposed DDSTC in Example 14.31 with that of the circulant 
codes in |3| and the cyclic codes in for transmission 
rates of 1 bit per channel use (bpcu) and 1.5 bpcu. The 



signal set chosen for the proposed code is as given by 
Construction 14.41 with parameters r\ — 7^ r 2 = \J\ f° r 
a transmission rate of 1 bpcu and for the case of rate=1.5 
bpcu, the signal set parameters are i\ — 0.378, r 2 — 0.8452, 
7-3 = 1.1339 and r 4 = 1.3628. The circulant code of is 
given by {uiAi, it 2 A 2 , u 3 A 3 , it 4 A 4 |uj e ^i, i = 1, . . . , 4}. 
The relay matrices Ai are taken to be powers of the 4x4 
circulant matrix. The signal set J^i is chosen to be 9{ radians 
rotated version of 64-PSK for a transmission rate of 1 bpcu 
and 1024-PSK for a rate of 1.5 bpcu. The rotation angles 
are chosen as follows to guarantee full diversity: 6\ = 0, 
f?2 = 1.5, 63 = 3, 84 = 4.5. Another reference for comparison 
is the cyclic codes proposed in JJj whose codewords for 
rate=l bpcu and rate=1.5 bpcu are given as follows: 




e256 






r 67 

S.256 





A877 
S4096 



and 







AlOl 
^256 





A2039 
S4096 

C4096 = 



= 0, 



,255 



, , 4095 



and 



esi. For simulations, we 
have assumed a block fading channel which is quasi-static for 
800 channel uses and varies independently from one block to 
another. 

Fig. [5] shows the error performance curves of the proposed 
codes in comparison with those of Q, Q. It can be observed 
from Fig. [5] that for a transmission rate of 1 bpcu, the 
proposed code outperforms the cyclic code by about 5 dB 
and the circulant code by more than 10 dB. Similarly for 
a transmission rate of 1.5 bpcu, it can be observed that the 
proposed code outperforms the cyclic code by about 5 dB and 
the circulant code by more than 25 dB. It is worthwhile to 
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Fig. 5. Error performance comparison of the proposed codes with those of 
and (5] 

note that for rate=l bpcu, the decoding search space for the 
proposed code is only 4 whereas it is 256 for the other two 
codes. Similarly for rate=1.5 bpcu, the decoding search space 
for the proposed code is only 8 whereas it is 4096 for the other 
two codes. Because of the exponential growth of decoding 
search space with increasing transmission rate for the codes 
in 0, (5), it takes a prohibitively large time to complete a 
error performance simulation for higher rates. 

VI. Discussion 

We have thus constructed a class of four group decodable 
DDSTCs for any power of two number of relays using 
algebraic techniques. It is important to note that relaxing the 
unitary matrix codebook to scaled unitary matrix codebook 
has aided us in obtaining decoding complexity benefits. A 
limitation of this algebraic method is that it is available only 
for power of two number of relays. Extending this method to 
obtain g-group decodable DDSTCs for g > 4 appears to be 
difficult for two reasons: 

1) The rate of the known linear designs in the literature 
for g > 4 fall below 1 and obtaining R unitary relay 
matrices satisfying Condition (01 is difficult and it is not 
clear to the authors whether such matrices exist at all. 
For example, constructing the relay matrices satisfying 
Condition ([3]) for orthogonal designs is difficult. 

2) Even if the relay matrices are obtained, an initial vector 
satisfying Condition (01 may not exist for the chosen 
relay matrices. 

For the specific case of g = 4, things were possible because the 
linear designs correspond to left regular matrix representation 
of an algebra over C and hence using algebraic techniques, 
it was possible to construct the relay matrices and the initial 
vector satisfying all the required conditions. 

Optimizing the signal sets for coding gain might be possible 
for small number of relays and for small number of points. 
However it becomes difficult for larger number of relays and/or 



larger number of points because it involves many parameters 
and a general closed form solution is difficult. Hence optimiz- 
ing the signal sets for coding gain is an important direction 
for further work. 

Appendix A 

Construction using Extended Clifford Algebra 

Definition 3: Let L = 2 a ,a e N. An Extended Clifford 
algebra denoted by A^ is the associative algebra over R gen- 
erated by n + a objects jk, k = 1, . . . , n and r5j, i = 1, . . . , a 
which satisfy the following relations: 

• 7fc = -1) Vfc = l,...,n; 7^ = -7^7*, V k ^ j 

• Ofr = 1, V k = 1, . . . , a; S^Sj = SjSk, V 1 < fc, j < a 

• 8klj = 7j4, V 1 < k < a, 1 < j < n. 

The classical Clifford algebra, denoted by Clif '/„, is obtained 
when only the first two relations are satisfied and there are no 
Si. Thus Clif f n is a sub-algebra of A„. Let SS n be the natural 
R basis for this sub-algebra. 

^„U{^ n Si\i = l,...,a} 
\f m =2@n{T[7=M±<h<k i+1 <a} 

Then a natural R basis for is: 

^n= {l}U{7i|* = !,•••. n} 

UUa{Il£i7fc,|l<*i <**+!<«}■ 
We need a unitary matrix representation for 
the symbols 1, 71, 72, 7172, 6 k , k = l,...,a, 
Um=2 < fci < fei+i < a in the algebra Af. 

Such matrices are naturally provided by the left regular 
representation. 

We first view A2 as a vector space over C by thinking of 
71 as the complex number i = y— T. A natural C basis for 
A2 is given by: 

B£= {1, 72} U{{1, 72 RN = a} 

U™= 2 {l,72}{a™i<yi<fc i <h+x <a}. 

Thus the dimension of Aj seen as a vector space over C is 
2 n+a_1 . We have a natural embedding of Aj into Endc(A2 ), 
(the set of all C-linear maps from Aj to itself) given by left 
multiplication as shown below. 

cj) : A% i-> End c (A^) 
(f>(x) = L x : y h-> xy. 

Since L x is C-linear, we can get a matrix representation of 
L x with respect to the natural C basis B^. Left regular 
representation yields unitary matrix representations for the 
required symbols in the algebra. The resulting linear designs 
are precisely those given by Construction 14.31 This has been 
explicitly shown in more detail in iflOll . 

Appendix B 
Proof of Theorem[3] 

Proof: Let us prove the theorem assuming that all the 
complex variables of the linear design take values from the 
entire complex field. We use the fact that the linear design for 
R = 2 A relays was obtained as a matrix representation of the 
Extended Clifford algebra A| . Thus a = A — 1. We choose 
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M = 2 a — -j. The M relay matrices are explicitly given by 
the union of the elements of the sets {4>(1), <K^i), ■ • ■ , </>(£ a )} 
and {Um =2 nili^(4J|l < h < k i+ i < a}. By virtue of 
the property that is a ring homomorphism, these matrices 
are guaranteed to commute with all the codewords because 
they are matrix representations of elements belonging to the 
center of the algebra Aj . To obtain the remaining R — M 
relay matrices, we need to find unitary matrices which satisfy 



AjC* = CA h i = M + l,. 



,R 



(23) 



where, C is any codeword. But the codeword C is simply 
a matrix representation of some element belonging to the 
Extended Clifford algebra. One method to get these relay 
matrices is to take them from within the Extended Clifford 
algebra itself. By doing so, we can translate the condition in 
(1231 into a condition on elements of the algebra which will 
then provide us a handle on the problem. Towards that end, 
we first identify a map in the algebra which is the analogue of 
taking the conjugate of the matrix representation of an element. 
Note from Appendix [A] that we used the fact that 71 can be 
thought of as the complex number i = When we take 

the conjugate of a matrix, we simply replace i by —i. Hence 
the analogue of this action in the algebra is to replace 71 by 
—71. Thus, the analogous map a in the algebra is defined as 
follows: 

a : x I— > x (24) 

where, the element x is obtained from x by simply replacing 
71 by —71 in the expression of x in terms of the natural M- 
basis of Extended Clifford algebra. Now the problem is to find 
R — M distinct elements denoted by clu i = M + 1, . . . , R 
of the algebra A2 which satisfy aiX = xa,i, V x € 
The elements of the union of the following two sets satisfy 
the above required condition. 



{-y 2 {l,S 1 ,...,5 a }},l 72 I Q n^.l 1 ^ h ^ + X ^ a 



This can be proved by using the anti-commuting prop- 
erty, i.e., 72 (—71) = 71(72)- Hence the matrix rep- 
resentation of these specific elements gives the unitary 
relay matrices Ai,i = M + 1, ...,R. Suppose we 
plug in these relay matrices to form a linear design 
X = [ Ais . . . Ams Am+iS* . . . Ars* ] where, 

s = [ x\ X2 ... xr ] T , it turns out that we get exactly 
the same linear design which is used at the source. Thus the 
initial vector choice of So = [ 1 ... ] guarantees 
that the initial matrix Xo is an identity matrix. 

However, we would like to point out that there 
are also other elements of the algebra which satisfy 
a,iX — Xdi, V x € Aj . For example, consider the union 
of the elements of the sets {7172 {1, Sx, ■ ■ ■ , S a }} and 

{7172 {[j a m=2 Til"! < *i < fc *+l < «}}■ " 
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